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Abstract 


The  connection  between  the  Baker-Hausdorff  formula  and  systems 
of  linear  differentiail  equations  is  discussed.     On  the  algebraic  side,  a 
proof  for  a  theorem  by  K.  0.  Fi*iedrichs  is  given  which  characterizes  Lie- 
elements  in  a  free  non-commutative  ring.       On  the  analytic  side,   criteria 
for  the  existence  of  solutions  of  systems  of  linear  differential  equations 
in  terms  of  a  finite  nxunber  of  quadratures  are  derived.     The  range  of  validity 
for  the  representation  of  the   solutions  in  the  form  of  an  exponential  function 
of  a  matrix  is  discussed.     The  case  corresponding  to  a  second-order  differen- 
tial equation  is  studied  in  detail. 
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Introduction,     ^e  natural  generalization  of  the  first-order  linear 
differential  equation 

(1)  §  =  a(t)y 

is  a  system  of  n  linear  differential  equations 

(2)  ^  =   J:  a^^^(t)y^;  (v  -  1,2, .. .,n) 

which  is  characterized  by  a  matrix  A(t)  vrith  elements  a         which  are  functions 
of  t.     This  system  is  equivalent  to  a  differential  equation  for  a  matrix  I: 

(3)  S-A(t)Y. 

The  columns  of  Y  form  a  set  of  n  linear  independent  solutions  of  (3)*  i.e., 
a  set  of  vectors  with  components  y  satisfying  (2),  Equation  (3)  may  be 
interpreted  in  an  even  more  general  way  if  we  assame  that  I  and  A  are  linear 
operators  in  a  Hilbert  space  which  depend  on  a  parameter  t  in  a  manner  which 
allows  differentiation  and  integration  with  respect  to  t. 

The  importance  of  equations  of  type  (3)   for  various  brsmches  of 
applied  mathematics  and  theoretical  physics  is  well  known.  The  present  in- 
vestigation has  been  stimulated  by  two  recent  papers.  In  the  first,  K.  0. 
Friedriche  [l]  arrived  at  some  purely  algebraic  problems  in  connection  with 
the  theory  of  linear  operators  in  quantum  mechanics.  The  answer  to  one  of 
these  problems  turns  out  to  be  closely  connected  with  the  so-called  Baker- 
Hausdorff  formula  which  is  for  (3)  the  proper  generalization  of  the  well- 
known  solution  of  (l).  In  another  connection,  H.  B.  Keller  and  J.  B.  Keller 
[2]  resumed  and  continued  the  work  by  H.  F.  Baker  [3]  on  systems  of  ordinary 
linear  differential  equations.  In  the  present  report  their  results  are  sup- 
plemented by  a  discussion  of  certain  cases  where  an  integration  of  a  system 
by  a  finite  number  of  quadratures  is  possible  but  where  the  matrix  A  in  (3) 
cannot  be  diagonalized. 

Algebraic  questions  arise  in  connection  with  (3)  for  the  following 
reason:  If  it  should  happen  that  for  any  values  of  s,t,  the  "commutator" 
(U)  Aj^  '   A(s)A(t)-A(t)A(s)  -  0, 
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then  indeed  (3)  would  have  the  solution 

t 

(5)  Y  -  exp /■  A('r)dT  , 

^  o 

where  the  exponential  function  of  a  matrix  is  explained  by  the  power  series  and 
where  the  integral  of  a  matrix  is  obtained  by  integrating  every  element  of  it. 
Since  (Ii)  will  not  be  satisfied  except  for  particular  cases,  one  may  expect  to 
be  able  to  write  the  solution  as 

(6)  I  -  expil  (t) 

•vrtiere  A   can  be  expressed  in  terms  of  yAdt   and  integrals  which  involve  A  , 
and  commutators  derived  from  A ,    and  A;   the  commutators  are  derived  by  repeti- 
tions of  the  process  which  leads  from  A  to  A-,  .     A  formula  of  this  kind  indeed 
exists  and  it  is  based  upon  the  addition  theorem  of  the  exponential  function 
for  non-connraitative  variables.     This  theorem,  discovered  by  Baker [U] and  Haus- 
dorff£5],is  a  formula  referring  to  an  abstract  ring  and  hence  it  does  not  in- 
volve questions  of  convergence.     Therefore,   a  formula  of  type   (6)   can  be  es- 
tablished in  a  formal  way;     however,  its  range  of  validity  must  then  be  investi- 
gated separately. 

Feynman[63,  using  a  symbolic  interpretation  for  Eq.   (5),  has  derived 
the  solution  of  (3)  in  the  same  infinite  series  form  obtained  when  (3)   is  solved 
by  iteration. 

The  first  part  of  this  report  deals  with  algebraic  problems.     Section  I 
contains  the  necessary  definitions  and  restatements  of  basic  results.     Section  II 
contains  a  proof  of  Friedrich's  theorem  which  chcu:acteri2ES  Lie-elements  of  a  free 
non-commutative  ring  in  a  simple  manner.     In  Section   m   the  continuous  analogue 
of  the  Baker-Hausdorfr  formula  is  proved  and  a  slightly  simplified  derivation  of 
the  fonmila  itself  is  given.     Section  IV  contains  a  method  of  computing  the  terms 
of  Zassenhaus'  formula  which  is  in  some  respects  the  inverse  of  the  Baker- 
Hausdorff  formula. 

In  the  second  part  of  this  report  the  results  of  the  first  part  are 
applied  to  systems  (3)   of  linear  differential  equations.     Section  V  contains 
a  discussion  of  the  case  where  the  infinite  series,  which  constitutes  the  con- 
tinuous analogue  of  the  Baker-Hausdorff  fonnula,    terminates  after  a  finite 
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number  of  terms.  In  this  case,  (3)  can  be  integrated  by  a  finite  number  of 
quadratures i  this  is  not  explicitly  clear  from  a  consideration  of  the  matri- 
zant  (i.e.  the  solution  obtained  by  iteration).  It  is  also  shown  that  the 
conditions  for  the  termination  of  the  series  are  weaker  than  one  might  ex- 
pect them  to  be.  For  instance,  (5)  is  a  solution  of  (3)  if 

(7)  ^^^^  /  A(s)ds  -ij    A(s)dsJ  A(t)  s  0, 

and  (7)  does  not  imply  (U),   at  least  not  in  the  case  of  a  matrix  A    which  is 
not  analytic  in  t.     In  Section  VI  the  possibility  of  solving  (3)  in  terms  of 
anil  defined  by  (6)  is  discussed.     The  following  result  will  be  proved: 

Let  il  ^  be  a  matrix  in  the  space  of  the  n     complex  variables  vhich  consti- 
tute^ the  general  matrix   A  .     J- hen  a  full  neighborhood  of  the  matrix   O 

o 

will  be  mapped  upon  a  full  neighborhood  of  I   =  e:q)  Ci       if  and  only  if  the 

difference  betvreen  any  two  eigenvalues  is  not  2niii  ,  n  »  1,2,3,    ...    ,  n  ^  0. 
From  this  it  follows  that,   in  general,   a  dif fere nti able  XI  such  that  (6)  is  a 
solution  of  (3)  will  exist  only  in  the  small.     The  situation  is  investigated 
more  closely  in  Section  VII,  where  the  equation  (3)   is  interpreted  as  corres- 
ponding to  a  second-order  differential  equation  y     +Qy  =  0,   Q>0. 
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First  Part;  Formal  Algebra 

I,  Preliminaries 

A  free  associative  ring  K  vdth  n  free  generators  x, , . . . ,x  and 
an  identity  1  will  be  defined  by  the  following  four  axioms: 

a.  The  elements  of  a  field  f  of  characteristic  zero  (for  instance 

the  field  of  real  numbers)  are  in  R  ,  and  the  unit  element  1  of  f  is  the 

o'  o 

identity  of  R  .  The  field  f  belongs  to  the  center  of  K  ,  i.e.,  all  its 
o  o  °  o  ' 

elements  commute  with  all  the  elements  of  R.  We  shall  call  f  the  field 

0     

of  the  coefficients. 

b.  The  addition  in  R  is  comntutative  and  associative,  and  the 

o 

multiplication  is  associative, 

c.  There  exist  no  relations  between  the  elements  of  R  except 
those  which  follow  from  a  and  b. 

d.  Every  element  of  R  can  be  obtained  from  the  elements  of  f 

"'  o  o 

and  from  x.,  ...,  x  by  carrying  out  a  finite  number  of  additions  and  mul- 
tiplications. 

It  follows  from  the  axioms  a  -  d  that  the  identity  1       and  the  products 
of  any  number  of   "factors"  x°  ,   i.e.,  the  products 


a,      a^  a 


V 

a 
m 


(1)  x^     x^    .  .    .  x^       ,  (a-j^jOg,  .  .    •  >  %  =  1>2,3,  .  .    .)j 

12  m 

with 

(2)  v^  /  v^  /  v^   .    .    .  v^_^  '^  \  »  V  '  '^'^'  '  '    •  "  ^°^  ^^  "  1>  •  •    '>  ^, 
form  a  basis  of  linearly  independent  elements  of  R     with  respect  to  f   . 

We  may  extend  R     to  a  ring  R,  which  consists  of  all  the  formal  power 
series  with  coefficients  c   ,c(v-,  ...,v,a-,,...,a)inf,  the  generic 

element  A  of  R  being 


(3)     A  -  c^  +  XI  c(v^,    .    .  .  ,Vj^;  a^,  .  .    ,,cj 


1  m 
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i'he  sum  in  (3)  is  taken  over  all  possible  combinations  of  integers  v,  ,...,v  , 

a,,.., a     satisfying  (l)  and  (2)  for  m  «  1,2,3,...    .  A  power  series  of  the  type 
described  by  (3)  will  also  be  called  a  function  of  x,,...,x     and  will  be  de- 
noted by  F(x-.,...,x  ).     Problems  of  convergence  do  not  play  a  role;     if,  for 

instance,   f     is  the  field  of  rational  numbers,  both 
'     o  ' 

00 

Unl  x{ 

n=0 

and  its  square 

00         n 
fZ        HrKn-r)!     x" 

n=0       r=0 

belong  to  K.     Here  we  have  used  the  natural  notation  according  to  which  x     «  1 
for  any  v. 

As  an  example  which  will  be  used  later  we  may  consider  the  case  of 
two  free  generators  x,y;  we  take  for  f     the  field  of  real  numbers.     The  ex- 
ponential function  is  defined  by 

(U)  e^  -    Yi    x^/nl 

n=0 

and  we  have 

(5)  e^^e^  =    ^      x"y'"/(nlml). 

n,m=o 

We  can  find  a  fxinction  z  of  x  and  y   such  that 

(6)  e^  e^  =  e* 
and 

(7)  z   -  u  -  I  u^  +  i  u^  T    .    .    .    , 

where 

(8)  u  -  (e^e^  -l).x-.y+    |-+_^.y_....^ 

If  we  substitute   for  u  its  value  from  (8),  it  is  easily  shown  that  the  series 
in  (7)  leads  to  an  element  of  R.     For  this  pui^pose,  we  shall  call 

a,    +  a_  +    . .  +  a 
12  n 
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the  degree  (in  x^,...x  )  of  the  basic  element  (l);     in  (5)  the  terms  of 
degree  ^   (in  x,y)  will  then  consist  of  the  sura 


X     y 


(9)  ^  E  (J) --"•-'■"■ 

^'    r-0 

Only  a  finite  number  of  powers  of  u  will  involve  terms  of  a  given  degree 
in  x,y,  since  the  terms  of  lowest  degree  in  u*^  are  of  degree  k  in  x  and  y. 

Therefore,  z  becomes  a  power  series  in  x,y  with  rational  coef- 
ficients, the  first  terms  being 

(10)  z  »  x+y  +  ^  xy  -  ■^  yx  +  .  .  .   . 

Baker [U]    and  Hausdorff  [5]   proved  a  theorem  about  the   terms  of 
higher  degree  in  the  power  series   (10)  for  z;     we  may  state  this  theorem  as 
follows: 

Let  u,v  be  any  elements  of  H.     Then  the  bracket-product  or  Lie- 
product        t^jv]   of  u  and  v  is  defined  by 

(11)  [u,v]   »  uv-vu  . 

Using  Lie -multiplication,  we  can  define  recursively  a  Lie-slement  of  R.     We 
shall  call  x  and  y  (or,  in  the  general  case,   the  free  generators  x^)  Lie- 
elements  of  degree  one.     Any  linear  combination  of  Lie-elements  of  degree  one 
with  coefficients  from  f     and  any  Lie-product  of  Lie-elements  shall  also  be 
called  a  Lie-eleraent,     The  total  set  of  Lie-elements  obtained  in  this  manner 
will  be  calledA.     For  properties  of  this  set  aBe[7],[8j,  and  [9j.     In  the 
case  of  two  free  generators  x,y  the  general  Lie-element  involving  terms  of 
a  degree  ^  3  is 

(12)  c^x  +  Cgy  ♦  c^^{xy~jx)  +  c^^-j^  Uxy'yx)x  -  x(xy-yx)  S 

where  c, ,...,c,pp  are  elements  of  the  field  f^  (e.g.  rational  numbers).     The 

result  of  Baker  [U]  and  Hausdorff  [S]  is  that  z  in  (lO)  is  a  Lie-element; 
also,   a  method  of  computing  z  up  to  terms  of  any  fixed  degree  is  given  in 
their  papers.     Formula  (6),  with  z  givei.  by  equation  (lO),   is  called  the 
Baker-Hausdorff  formula.     The  connections  between  this  formula  and  the   theory 
of  differential  operators  will  be  investigated  in  the  following  sections. 
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II.  A    Theorem  of  Friedrichs 

In  a  discussion  of  the  theory  of  operators  of  quantum  mechanics, 
Friedrichs  [ij  found  a  characterization  of  Lie-elements  and  proved  it  in  a 
particular  case  by  using  a  theory  of  representation  of  these  operators.     We 
shall  formulate  and  prove  Friedrichs'  theorem  in  the  case  where  the  ring  R 
has  two  free  generators  x,y.     But  the  proof  for  a  frea  ring  with  any  number 
of  generators  is  almost  literally  the   same . 

We  constmict  first   an  isomorphic  replica  R     of  R  which  has  two  free 
generators  x  ,y    .     Then  we  construct  the  direct  product  h  of  K  and  R   ,  iden- 
tifying the  elements  of  the  underlying  isomorphic  fields  of  coefficients  in 
R  and  R' .     The  ring  R  is  generated  by  x,y,x',y',  where  the  generators  are  not 
entirely  free  but  satisfy  the  relation 

(1)  XX'   *  x'x  ,  yy'   =  y'y,     xy«   =  y'x,  x'y  «»  yx'  . 

We  may  consider  any  element  of  xt  as  an  element  of  R,  which  contadns  R.  Now 
we  can  state  Friedrichs'  theorem: 

Theorem  Ii 

Let  F(x,y)  be  a  function  of  x  and  y.     Vnen  F  is  a  Lie-element  of  R 
if  and  only  if 

(2)  F(x+x',  y+y')  -  F(x,y)   +  F(x',y')    . 

To  prove  this  theorem,  we  may  confine  ourselves  to  the  case  where  F 
is  homogeneous  in  x,y,  that  is,  where  F  is  a  sum  of  terms  of  a  fixed  degree 
It  is  easily   seen  that    (2)   holds  if  F  is   a  Lie-element.     In  this  case,  F  must 
be   a  sum  of  terms  which  have  been  derived  from  the  generators  by  a  repeated 
application  of  Lie -multiplication.     Consider  a  term  w  involved  in  F,  which 
has  been  obtained  from  two  Lie-elements  u  and  v  of  R  by  Lie-multipldcation: 

w  •»  fujv]]   . 

Then  u  and  v  are  of  lesser  degree  than  w  and  we  may  assume  that,  as  functions 
of  X  and  y,  they  satisfy  (2).  bince  it  follows  from  (l)  that  the  bracket  pro- 
duct of  a  factor  depending  on  x,y  and  a  factor  depending  on  x',y'  is  always 
zero,  we  find  that  w  also  satisfies  (2). 

To  show  that  only  Lie-elements  satisfy  (2)  we  introduce  first  the 
following  definition: 

*  A  proof  different  from  the  one  given  here  and  based  on  a  lemma  due  to 
Birkhoff  and  Witt  []8j  has  been  communicated  to  the  author  by  Dr.  P.  M. 
Cohn  of  Manchester  University. 
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Any  r  elements  (r«l, 2, 3, . . .)  of  R  are  called  algebraically  independ- 
ent  if  they  generate  a  subring  of  R  which  is  isomorphic  with  a  free  ring  of 
r  free  generators.  More  specifically,  let  u, ...,u  be  the  r  elements  of  R 
and  let  yi,...,y  be  free  generators  of  a  free  ring  R*.  Then  we  shall  call 
the  u  ,(p»l,  ...,r)  algebraically  independent  if  the  mapping 

y  ->u        (p  =  l,...,r) 

determines  an  isomorphic   (one-to-one)   correspondence  between  R*  and  the 
siTiallest  subring  of  R  which  contains  the  u   .     We  define  the  maps  of  sums 
and  products  in  the  natural  manner.) 

Next,  we  need  the  following  two  lemmas,  which  have  been  proved 
else^ere   ( see  jj]  ): 

Lemma  1.     Let  u, ,.,.,u     and  v  be  r+1  algebraically  independent  ele- 
1  r 

ments  of  a  free  ring  R*.     For  Jl  =  1,2,3,...  and  p  =  l,2,...r,  let 

(U)  u(°)  -  u^  . 

The  right-hand  side  in  (3)  is  the  result  of  an  -^ -f old  bracket-raultiplica- 
tion  of  u    by  V  from  the  right.     Then  all  the  u'     are  algebraically  independ- 
ent (for/  =0,1,2,...)    . 

Lemma  2.     Every  function  H(u, , ...,u^,v)  can  be  written  in  a  unique 
way  in  the  form 

(5)  H  -  H^  +  H^v+  HgV^  +    ..., 

where  H  ,A-.,]i^,    ...   are  functions  of  the  u       ,   ( /«0,1,2, . .. )    . 

We  shall  apply  these  lemmas  to  the  proof  of  Friedrichs •  theorem. 
Let  F(x,y)  be  a  function  which  satisfies  (2)  and  is  of  degree  d  in  x,y.  F  is 
then  expressed  in  terms  of  Lie-elements  of  the  first  degree. 

Assume  that  F  could  also  be  written  as  a  function  H(u, , . . .,u^,v)  of 
algebraically  independent  Lie-elements  of  degrees 

i  1  ^    ^2  "    •    •  =    ^  =    -^o  » 
where  I       is  the  degree  of  v  in  x,y.     We  shall  derive  the  fact  that  then  F 
can  also  be  written  as  a  function  of  algebraically  independent  Lie-elements 

u^^  of  degree    i      +tl    .and  once  this  has  been  established  it  is  clear  that 

F  must  be   a  Lie-element.     From  Lemma  2  we   see  first  that  the  degree  d  of  F 


-  10  - 

is  not  less  than  the  value  of  the  largest  of  the  numbers   i.^,,..,  1  ^,    l^  and 
second,  that  the  transition  from  the  set  of  variables  u^  and  v  to  the   set 

u^    '  eliminates  one  of  the  variables  of  lowest  degree -£^.     After  a  finite 
r  o 

number  of  transitions  of  this  type,  we  will  arrive  at  a  set  of  variables 
which  are  Lie-elements  and  none  of  which  is  of  a  degree  less  than  d.     Then 
F  must  be  a  linear  combination  of  these  variables,  liiich  proves  that  it  must 
be  a  Lie-element. 

Therefore,  only  the  following  remains  to  be  shown:     If,  in  terms  of 
the  Lie-elements  u,,...,u  ,   v, 

(6)  F(x,y)  »  H(u^,...,u^,  v), 
and  if  F  has  the  property  (2),  then 

(7)  F(x,y)  -  H^(u^^^)    . 

Since  the    "variables"  u-,,..,u   ,  v  are  Lie-elements  of  the  ring  R  generated  by 

1  r 

x,y,  we  have 

(8)  w(x+x',  y+y')   »  w(x,y)   +  w(x',y')    » 

where  w  stands  for  any  one  of  the  elements  u-,...,u   ,  v  .     We  shall  write  w' 
for  w(x',y')   and,  correspondingly,  u',v'  for  u  (x',y'),  v(x«,y').     Now  we 
have  from  (2),   (6)   and   (8) 

(9)  F(x+x',  y+y')  -  H(u^+u^'  ,...,u^+iy  ,v+v'  )    . 
Applying  Lemma  2  to  H,  we  find: 

(10)  H(u^+  u^,...,u^+  u^,  v+v  ) 

.H(u">.u("').H,(u(^>.u("')(vV) 
op  p  1     P  P 

.H^(u«>.  u«'')(vV)^...   , 


1     are  deri'^^'^  fr^rsm   tvio  n^  \ 
P 

Now  condition  (2)  gives 


where  the  u^^^  are  derived  from  the  uj'^by  the  transition  from  x,y  to  x',y'  • 
P  P 


-  u  - 


(11)  H(u^+  u^  ,...,u^+  u^,  v+v  ) 

«  H(u^,...,Uj.,v)  +  H(u^,..,,u^'  ,v  ) 


I   I. 


*V>'^«)),*H^(u(*'')v' 

.H,(u«>)v2.H,(u<«')y'2 

2   p  2  p 

k   k* 
According  to  Lemma  2,  the  coefficients  of  the  products  v  v'   on  the  right- 
hand  sides  of  (10)  and  (ll)  must  be  the  same.  From  this  we  have 

(13)       «>(«).«(«■).  hX«)*hX*)') 

(13)        H,(u<«.u^«>').hX^))-Vu(^)') 
(U)       H3(u(^'.  u(«').H^(u«))-H,(u«'') 


From  equations  (13)  and  (lU)  we  may  conclude  that  H,,H^,  ...  vanish  identically  in 
the  ur^  J  and  equation  (12)  verifies  that  H  has  the  same  property  as  the  proper- 
ties of  F  from  which  we  started.  This  finishes  the  proof  of  Theorem  I.  It  has 
been  used  by  Friedrichs  to  prove  the  following 

Theorem  II  .  Let  x,y  be  free  generators  of  an  associative  ring  R.  Let  z 
and  w  be  the  elements  defined  by 

(15)  e^  e^  -  e^ 

(16)  e'  ye  =  w  . 

Then  z  and  w  are  Lie-elements  in  R. 

I  I 

X'^X  X    X 

The  proof  follows  immediately  from  Theorem  I  if  we  observe  that  e         ■  e • e 
and     XX'   =  x'x.     Baker [[UJ   and  Hausdorff  [^5] proved  Theorem  II  by  a  recursive 
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construction  of  z  and  by  an  explicit  formula  for  w.  Once  one  knows  that  w  is 
a  Lie-element,  it  is  easy  to  determine  it  explicitly.  Since  it  is  of  first 
degree  in  y,  it  must  be  expressible  in  terras  of  the  Lie-elements 

(17)  {y»*^}=  [..[[y,x]x]...x] 

which  are  obtained  from  y  by  an  ^-fold  bracket  multiplication  by  x.  The  result  is 

(18)  V   .^f  ^{^.x*}, 

where  ■ly,x''' l  stands  for  y  itself. 


M 


III.  Differentiation  and  Differential  Operators 

Let  R  be  a  free  associative  ring  with  free  generators  x,y,x,,y^... 
Let  F(x,y)  be  an  element  of  R  and  let  X  be  a  parameter,  i.e.,  an  arbitrary  number 
from  the  field  f  (for  instance  an  arbitrary  real  number).  -Chen  we  can  expand 
F(x+  Xx,,y)  in  a  series  of  powers  of  \  : 

(1)  F(x+\x^  ,  y)  -  F(x,y)  +  XF^(x,x^,y)  +  ...  . 
We  shall  call  the  coefficient  of  X  a  derivative  and  write 

(2)  F^(x,x^,y)  '   i^^   )F(x,y)  . 

The  word   "derivative"  was  introduced  by  Hausdorff .     Another  more  customary  notation 
for  this  coefficient  is    "polar",       and  the  process  by  which  F,    is  obtained  from 
F  is  also   called   "polarization".     The  definition  of  a  derivrtive  used  here  is 
different  from  the  one  used  by  Falk   [lO} .     If  F  is  a  monomial  (that  is,   an  ele- 
ment of  the  type   (I.l)),  polarization  consists  of  first  replacing  one  factor  x 
by  x-   in  every  possible  way  and  then  adding  all  the  resulting  terms  afterwards. 
The  polar  of  a  sum  of  terms  is  the  sum  of  the  polais  of  the  terras.     If  F  does  not 

involve  x  ^its  polar  with  respect  to  x     is  zero.     It  is  not  necessary  that  x.   be 

9  3 

different  from  xj     we  can  define  (xt-)F  by  fonning  (x,-^)F  and  substituting  x  for 

x^  after  the  polarization.     We  may  also  substitute  any  element  u  of  R  for  x, 

after  polarisation  and  ^d.ll  denote  the  result  by  (u  -|-)  F. 

dx 
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We  shall  need  the  following  formulas  which  are  due  to  Hausdorff  \S\i 


C3) 


=    y  +  5T  [y>x]  +  jr  [  [7»x]  x]  +  jjj  |^[  [y,x]  xjx      +   ...  . 


Since  we  shall  need  repeated  bracket  multiplication  in  various  formulas,  we 
shall  simplify  the  notation  by  writing 

(U)  |y,  P(x)j  =  p^y  +  P-L[y,xj  +  P2  [  [y»x]  x]  +  ... 

if  ^ 

2 

(5)  P(x)     -  Pq  +  p^x  +  pg  X 

is  a  power  series  in  x.  We  may  then  write  the  equations  (3)  in  the  form 


-X 


(6) 


/   3  \  3 


e  -1 


((y^)e==)e--f7, 


1-e 


-X 


If  P(x)  has  a  constant  term  p  /  0,  there  exists  a  power  series 
Q(x)  such  that 

(7)  P(x)  Q(x)  -  1  . 

We  have  then,  according  to  Hausdorff  |_5J  s 
Lemma  3  :  Let  P(x),  Q(x)  satisfy  (7)  and  let 

(8)  |y,  P(x)]=  u. 

Then 

(9)  y  -  (u,Q(x)  J  . 

The  proof  consists  of  a  simple  straightforward  computation,  using  the  relations 
between  the  coefficients  of  P(x)  and  Q(x)  which  result  from  (7). 
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We  sha3J.  consider  now  differentiable  functions  of  a  real  parameter 
t  in  an  associative  ring  R.  Examples  of  functions  of  this  type  are  given  by 
finite  or  infinite  matrices  with  real  or  complex  elements  which  are  differen- 
tiable functions  of  t.  For  our  purposes,  two  types  of  properties  of  these 
differentiable  functions  are  required.  As  formal  properties  we  need  these: 

(a).  A(t)  is  an  element  of  R  for  all  real  values  of  t.  For  any 
sufficiently  small  e  , 

(10)         A(t+  e)  -  A(t)  +  6  A^(t)  +  e^  AgCt)  +  ...  , 

>^ere  the  terms  on  the  right-hand  side  of  (10)  are  in  R  and  where  their  sum 
shall  be  defined  in  R.     We  shall  define     ^  to  be  A^(t)    . 

(b).  The  formal  laws  of  differentiation  for  a  sxun  and  a  product  hold. 

(c).  If  P(x)   is  a  power  series  in  x  and  if  P(A(t))  «  B(t)  exists  in  R, 
then 

(")       f '  (^  fe  '  f(^>  ■  y  ■  i  ■  ==  ■  *<''  • 

(d).  Given  A(t),  there  exists  in  R  a  uniquely  determined  function  A  (t) 
such  that 

(12)      ^   -  A(t)  ,  A*(0)  -  0  . 

We  shall  write 

t 

o 

A  second  type  of  property,  which  is  more  difficult  to  describe,   concerns  prob- 
lems of  convergence,     ^or  the  present  purposes  it  will  be  necessary  to  assume 
that  exp  A  exists  for  all  functions  under  consideration  and   is  differentiable 
in  the   sense  that   (13)  exists.     Also,  it  will  be  necessary  to  assume  that  cer- 
tain repeated  integrals   (where  integration  is  defined  by  (12), (13)    )  exist. 


(13)  /    A(r)dr    -  A*(t)    . 
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It  may  suffice  here  to  make  two  remarks.  Firstly,  the  definition  of  a  free 
ring  R  is  already  based  implicitly  on  an  assTimption  about  convergence  in  the 
sense  that  any  sequence  u  (n  =  1,2,3,...)  of  elements  of  R  has  the  limit 
zero  if  minimum  degrees  \i    of  terms  involved  in  u  satisfy 

(1)4)  lijn    ti"-"-  -  0. 

n — ><x) 

Secondly,  we  can  construct  an  example  where  all  the  functions  considered  have 

the  convergence  properties  which  are  required  in  this  section.  Let  R  be  a  free 

associative  ring  with  a  finite  number  of  free  generators.  Let  u  be  any  sequence 

of  elements  of  R  for  which  the  minimum  degrees  li  of  terms  involved  in  u  have 

^     '^n  n 

the  property  (lU) .  '^hen  for  any  choice  of  the  u  , 

A(t)  -  E  %  t"" 

has  all  the  convergence   (or  existence)  properties  needed  in  this  section. 

The  case  where  the  A(t)   are  finite  mativices  with  elements  depending 
on  t  will  be  considered  extensively  in  the  next  section. 

We  can  now  state 

Theorem  III   .  Let  A(t)  be  a  known  function  of  t  in  an  associative 

ring  R^and  let  U(t)  be  an  unknown  function  satisfying 

(15)  S    ■  A^  »  "(°^  -  1- 

Then,  if  certain  unspecified  conditions  of  convergence  are  satisfied,  U(t) 
can  be  written  in  the  form 

(16)  U(t)  -  exp  A  (t) 
where 


dA 
dt 


00 


(17) 


■h±i\-^  <-''': 


A  - 1  [*,n] » ^  [  [A,  a]  n]  -  ^  .  ij  [[[  [A,ii]  njnjn]: 
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The  p   vanish  for  n  -  3,5,7,  ...,  and  ^^   -  (-1)"*'  B^^/{2m) .   ,   where  the  B^^ 

(for  ra«l,2,3. ..)  are  the  Bernoulli  numbers.  Integration  of  (1?)  by  iteration 
leads  to  an  infinite  series  f or  jl  the  first  terms  of  which  (up  to  terms  in- 
volving three  integrations)  are; 

t  t  y, 

A  ufkO:)dx-\f    [A(t),/ A(cr)d(r  j  dr 


J  f   UCt),  f  [A((r),y  A(p)dp]  d<r  J  d 


y 


t  r      '■  "I 

o  L       o  o       -* 

^  •  •  •  • 

Formula  (18)  is  the  continuous  analogue  of  the  Baker-Hausdorff  formula  by  which 
z  is  expressed  in  (11.1$)  as  a  Lie-element  in  terms  of  x,  y.     The  first  terms 
of  the  Baker-Hausdorff  formula  are 

z  =  X  ♦  y  +  I   [x,y]   +  X?     [  ^*^'^  y  ]   '  T?   [  E^^y]  ""  ] 

*i[[[x,y]y]^7fe[[[Cx,y]x]x]x] 

-  ~[[M^>>]    *    iBo  [[tt^'^  J  ^]  ^]  ^] 

-  iBo  [  [[r->^]  -]  y]  ^]  ^    ife[[M-]'  M] 

-  nij  [  [[x,y]  y],  [x,y]  ]  +  ... 


*~fhe  Computation  Laboratory  of  the  National  Bureau  of  Standards  has  provided  a 
method  of  calculating  the  coefficients  of  the  Baker-Hausdorff  formula  on  its 
electronic  computing  machine.     The  method  is  based  on  the  I^nkin-Specht-Wever 
formula  and  on  the  procedure  indicated  in  the  present  paper.  (See  Section  IV). 
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To  derive   (19),  we  can  proceed  in  the  sane  way  by  which  (18)  was  proved.     We 
observe  first  that  according  to  (3)  or  (6): 

(20)  e-^  (u  4-)e'  .    T    (k.z''-^}  I  n!   , 


n»l 


where 


(21)  i:  -  (u  1^)   z 
and 

(22)  Ix..^]   -    [f...[^,z]...J   z]    . 

The  quantity  u  may  be  considered  as  a  new  generator  or  variable .  Now  we  have 
from  (II. 15),  (3),  and  (II. 18): 

(23)  e-^u|^)e^.e-ye-Mu|^)e^ey 

(2U)  =e-y(  2(u,x"-^}/nl)ey  . 

n=l  L    ^ 

Comparing   (20)   and  (2U)  we  have 


(25)  f|^^,z"-H/nl     -e-y|u,  Vlfe^  , 


X 


and  (25)  holds  for  any  u  if  ire  define  X,  by  (21).     Now  we  choose  u  in  such  a 
manner  that  the  right-hand  side  in  (25)  becomes  as  simple   as  possible. 
For  this  purpose  we  try  to  determine  u  in  such  a  way  that 

(26)  [u,  2^ 

This  is  possible  and,   in  accordance  id.th  Lenma  3  (formulas   (8)   and  (9)    )  we 
have 


(27) 

VJith  this  value  of  u,   the  right-hand  side  of  (25)  is  equal  to  y  =  e"^ye^  , 
and  now,  using  Lemma  3  again  in  order  to  express  t,  in  terms  of  y  and  z,  we 
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obtain  from  (25) 


^■(-Ie)^-  {^*;tl}'^"^  &'^]*i2  [[y.^i -]>.... 


(28) 

In  a  similar  manner  we  find 


vhere 

(30)  ^*  .  (v  A.  )  z  . 

If  we  choose  v  «  y  we  find  from  (29),  Lemma  3,  and  (28),  that 

(31)  (y|)..|„-f-y  (,^),. 

Since  z  is  a  Lie-element  and  since  every  Lie-element  in  x,y  contains  both  x 
and  y  if  its  degree  is  at  least  two,  we  may  write  z  in  the  form 

z  «  z^  +  z,  +  z„  +  . . .+  2  +  . . .  , 
o    1    2        n      * 

where  z„  ■  x  and  where,   for  n  ■  1,2,3,    ..,   the  term  z     consists  of  all  the 

terms  in  z  which  are  of  degree  n  with  respect  to  y.     Since  terras  of  the  same 

degree  with  respect  to  y  on  the  two  sides  of  an  equation  must  be  equal  we  find 

from 


(32) 


that 


(y  3^7  )   z  ■  2,  +  22«  +   . . .  +  nz     +   . . . 
cjy  X.  c  n 


(u  ^)    (x  +   2^+    Z2*"*  +   2;     +...) 


(33)  2^     '-'^^k^  -, 

(3U)  \'\^-h-^. 


(35)  z      .  i    (u|-    )2      ,       . 

n       n         dx       n-1 
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The  meaning  of  equat-ions   (33)  to  (3$)   is  that  ve  apply  first  the  operator 
u  ^  and  then  substitute  for  u  the  expression  in  (27).     Apparently,    (35) 
allows  a  computation  of  z  by  recurrence. 

We  can  derive  Iheorem  III  from  Hausdorff's  formula  by  considering 
integrals  as  limiting  cases  of  a  sum.  However,  the  direct  proof  of  (l8)  by 
iterated  integration  of  (l?)   seems  to  be   simpler. 

IV.   Zassenhaus'   Formula 

Let  R  be  the  free  ring  with  two  generators  x,y  and  with  rational 
coefficients.     It  has  been  observed  by   Zassenhaus    [ll3  that  there  must  exist 
a  fonrmla  which  may  be  called  the   dual  of  Hausdorff's  formula.     We  may  state 
his  result  as  follows: 

There  exist  uniquely  determined  Lie-elements  C   (n*  2,3,U,...)  in 
R  which  are  exactly  of  degree  n  in  x,y  such  that 

(1)  e^^y  -  e^  e^  J"^  e^^   .    .    .  e^"   .    .    . 

The  existence  of  a  fonrnila  of  type   (l)   is  an  immediate  consequence  of  Haus- 
dorff's theorem.     But  the  computation  of  C     becomes  rather  difficult  if  it  is 

n 

based  on  Hausdorff's  complicated  formula.     A  simpler  method  for  the  calcula- 
tion of  the  C^  can  be  derived  from  a  result  due  to     Dynkin  [123   ,  Specht  [l3] 
and  Wever   [lli]    .     The  method  employed  here  has  already  been  used  by  Eynkin  to 
derive  the  coefficients  of  the  terms  of  degree  n  in  Hausdorff's  formula  with- 
out the  use  of  the  coefficients  of  terras  of  lower  degree. 

For  every  element  F(x,y)  in  R  we  define  a  corresponding  Lie-element 
-[fj  ,  where  the    "curly  bracket  operator"   [^   has  the  following  properties; 

a.  For  any  element  C  in  the  field  of  coefficients, 

(2)  |^Cf\-   c{f}. 

b.  For  any  two  elements  F,,Fj,  of  R 

(3)  ("l^'O     -    {^l}  ^    {^2}- 


c.  Let  x   ,  for  V  -  1,2,...,  n,  be  any  one  of  the  generators. 


Then 


V 


for  any  monomial  x^  x^...x^  we  define 

(U)  ^x^X2   ...x^^    -     r  [...[  [V2  1x3  ]...]  x^  j    ; 
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and  for  the  identity  we   define 

It  is  clear  that  the  operator  {]  is  defined  uniquely  for  all  F  in  R  by  the 
rules  a,b,c.     Now  the  result  proved  in[l2]    ,  [13]  ,  [llj   is:     Let  G  be  a  homo- 
geneous Lie-element  in  R  which  is  of  degree  n.     Then 

(5)  [g]  -  nG  . 

Ilie  following  result  can  be  derived  easily  from  Wever's  paper  LlU]  J 

Lerana  h-     If  G  is  a  homogeneous  Lie-element  and  F  is  aiiy  element  of 
R,  then 

(6)  £g^f}-o. 

Now  we  expand  both  sides  of  (l)  in  power  series  and  apply  the  opera- 
tor{]. 

According  to  Lemma  h.  we  find 

=  X  +  y 

since  -^  (x  +  y)"y  »  0  if  n  >  1.  In  the  same  manner  we  find  for  the  left-hand 
side  in  (l) 

(8)|e''ey  e  ^  e  ^  ...1  -   |^  x7nl  ^  y7nl  ^  C^  /nl  . . .  j 

y  +  \jy\  +  [C2^+  [xy^j  /2I  ♦  1^x02^  +  ^yCgj 


=  X  + 


+    ... 


where  the  omitted  terms  are  of  a  degree  greater  than  three.     By  comparing 
terms  of  the  sairie   degree   in   (?)   and  (8)  we  find 
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and  therefoi^,  because  of  (5)^ 

C3  -  -  5  [  [x,y]y  ]  +  5  {(x  +  y)(^  -  y^)} 
-  -  |[  [x,y]y  ]  -   I  [  [x,y]x  ]  . 

It  is  clear  that  by  this  method  we  may  also  confute  C  for  any  n  >  3  by 
recurrence  formulas. 
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Second  Part;  Matrices 

V.  Integration  of  Systems  of  Ordinary  Differential  Equations  by- 
Elementary  Formulas 

Let  A(t)  and  T(t)  be  n  by  n  matrices  the  elements  of  which  depend  on 
a  parameter  t.  V!e   consider  the  system  of  linear  homogeneous  differential  equa- 
tions 

W  ^    -  *T 

subject  to  the  initial  conditions 

(2)  l(o)  -  I, 

wher«  I  denotes  the  unit  matrix. 

From  well-known  general  theorems  we  know  that  (l)   always  has  a 
uniquely  determined  solution  Y(t)  which  is  continuous  and  has  a  continuous 
first  derivative  in  any  interval  in  which  A(t)  is  continuous.     The  elements 
of  the  k-th  column  of  Y(t)   are  the   solutions  of  y     of  the  system  of  linear 
differential  equations 

*"  S^'li    %,^^^     '  (v.l,2,...,n) 

subject  to  the  initial  conditions 

(U)  y^(o)   '  0  ±f  V  ji  ky     yj^(o)   -  1. 

The  a    are,  of  course,  the  elements  of  A,  and  they  are  functions  of  t. 

The  determinant  of  Y  is  always  different  from  zero;  its  value  is  given  by 


t  n 
(5)  I  Y  I  -  exp  (  /^  r  a^^  (s)  ds  )  . 

o  v^ 

We  wish  to  apply  Theorem  III  and  in  particular  formula  (III .18)  to 

equation  (l),  assuming  that  Y  can  be  written  in  the  form  Y  «  exp  JCL  .     In 

general,   the  use  of  Theorem  III  involves  difficulties  of  convergence;     some 

of  these  will  be   discussed  in  Sections  VI,   VII.     But  there  is  one  case  in 

which  (III. 18)  clearly  determines  .0.   for  all  values  of  t,  namely  when  the 

series  in  the  right-hand  side  of  (ill .18)  terminates.     This  will  happen, 
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for  instance,  if 

t  t 

(6)  A(t)  (/  A(r)  dt)  -  (/  A(^)d^)  A(t)    =o 

o  o 

identically  for  all  values  of  t.  If  (6)  is  true,  -Tl  becomes  simply 

t 

(7)  /  A(r)dr 

o 

and  Y»expil  satisfies  (l). 

In  order  to  state  a  concise  result,  we  introduce  the  followings 

Definition  of  a  Lie-integral  Functional.  Let  A(t)  be  an  integrable 
function  of  t  (in  the  ordinary  sense).  We  define  a  Lie-integral  functional 
^   of  veight  n  of  A  recursively  as  follows: 

(i)  The  functional  of  weight  1  is  any  multiple  of 

t 

(8)  r   A(s)  ds. 

"'c 

(il)  Let  $  ^,  $  ,  ...,  ^  be  any  functionals  of  weight 

X,  n,  ...,  V  such  that 

(9)  X  +  |i.+  ...v  +  p  =  n-1  . 

Then  a  functional  of  weight  n  is  defined  as  any  linear  combination  of  terms  of 
the  type 


(10) 


/   M...  [[a(s),  $  J9  J--- 


f. 


$  0       ds   , 


where  $^  ^ »  ♦••»  (b   ar©  written  as  functions  of  the  independent  variable  s. 

Apparently,  the  terms  involving  1,  2,  ...,  n  integrations  in  (III. 18) 

are  functionals  of  the  type  described  above}  we  shall  call  them  the  Baker- 

Hausdorff  functionals  B  of  A(t)  (for  c  «  0),  and  we  shall  write  (ill. 18)  in 
n 

the  form 

(11)  XI   =  B^  +  Bg  +  B-  +  .  .  .  , 
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where  the  B  shall  be  written 
n 

(12)  B^(a,  t,  c) 

if  the  matrix  A,   the  variable  t,  and  the  constant  c  are  to  be  exhibited.     In 
(III .18),  we  had  assumed  that  c  =  0.     Now  we  can  state 

Theorem  IV.     If  all  Lie-integral  functionals  of  A  of  a  weight  m 
vanish  (n  <  m  ^  2n-l  ),   then  the  solution  of  (l)  with  initial  conditions 
(2)  is  given  by  Y  "  exp  D.  ,  where 

(13)  ^'    71    K  (A,t,0). 

v-1      ^ 

The  B     are  the  Baker-Hausdorff  functionals  defined  by  (III. 18)   and  (11). 

A  sufficient  (but  not  a  necessary)  condition  for  the  vanishing  of 
Lie-integral  functionals  of  weight  greater  than  n  is  that 

(Hi)  I    [•••    '    &^^l^'  ^(^2^  J  ^^^3^  ]    •••  J^^Vl^ 

for  any  choice  of  s.  , . . . ,  s     ,    . 

Clearly,  the  fact  that  all  functionals  of  type  (10)  vanish  for 
weight  m  between  n  and  2n-l  implies  that  but  a  finite  number  of  linearly 
independent  functionals  must  vanish  identically. 

In  order  to  prove  Theorem  IV  we  need  first: 

Lemma  S'-     If  all  Lie-integral  functionals  of  a  weight  m  vanish, 
■vAere  n  <  m  'S  2n-l  ,  then  all  Lie-integral  functionals  of  any  weight  m  >  n  also 
vanish. 

Proof.     We  shall  consider  a  functional  of  type   (10),   asstiming  now 
that  ra-l+X+n+    ...+p  and  that  m  >  n.     Since  our  Lemma  is  trivial  for 
n  =  1,  we  may  assume   that  n  >  1.     Now  we  shall  apply  induction  with  respect 
to  m,   assuming  that  m  ^  2n.     If  one  of  th3  weights  X,  (ij    •••?  P  is  greater 
than  n,  the  corresponding   ^   vanishes  identically  and  the  Lemma  holds.     But 
suppose  all  of  the  weights  X,  p.,    ...,  p  are  ^  nj     then  we  consider  the  first 
number  S  greater  than  n  in  the   sequence 

1,   1  +  X,   1  +  X  +  [I,...,     1  +  X  +  n  ♦    ...   +  V,   1  +  X  +   ...  +  v+  p. 
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This  number  is  necessarily  at  most  equal  to  2n.     If  S  <  2n,  for  example,  if 
S«l+X+,..+v,     then 

is  the  derivative  of  a  functional  of  depi^ee  S,  *rtiere  n  <  S  <  2n,   and  therefore  not 
only  this  f\inctional  but  also   (10 )  vanishes  identically.     Hence  for  this  case  our 
Lemma  is  tnie.     There  remains  the  case  S  =  2n,  which  can  take  place  only  if  the 
last  term  vinS»l+X+..,+vis  equal  to  n.     Now  we  use  a  Lemma  proved  by 
V'ever  [.lUj  according  to  which  a  Lie-product  (15)  can  also  be  written  as  a  sum  of 
Lie-products  in  which  J       always  appears  in  the  first  place,  but  in  •vrtiich  the 
factors  and  the  arrangement  of  the  brackets  are  the  same  as  in  (15).     This 
Lemma  follows  without  difficulty  from  the  Dynkin-Wever-Specht  formula  (IV. 5). 
Consider  now  any  Lie-product  of  type   (15)  in  which  the  first  factor  is  S     . 
Tae  second  factor  is  either  A  or  one  of  the  other  factors,  which  may  be  called 
Y.     Now  we  merely  have  to  show  that 

(16)  [$^,  a]  .  0,  [^v'"^]  '  °   • 

If  (16)  holds,  any  product  involving  the  left-hand  sides  in  (16)   also  vanishes 

and  therefore  the  product  in  (15)  vanishes.       Now  (16)  is  true  because 

\  &     ,  Aj  »  -    [a,    ^     J   is  the  derivative  of  a  vanishing  functional  of  weight 
n  +  1.     Similarly,    1$^*$     1   "  ^  since 

Both  d^   /dt  and  dT/dt  are  sums  of  terms  of  the  type 

[...[  Ea,!^^]!^]...]    , 

where  T-,,  !„,... are  Lie-integral  functionals.     Therefore,  the  right-hand  side 

of  (17)  vanishes  since  the  individual  terms  are  derivatives  of  Lie-integral 
functionals  of  a  weight  k  (n+1  =  k  =2n-l)  .     %e  inequalities  for  k  follow 
from  the  fact  that  the  weight  of ^      equals  n  and  the  weight  of  1  is  less 
than  n  and  at  least  equal  to  unity. 
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This  finishes  the  proof  of  Lernma  $.     Probably,   a  better  result  could 
be  obtained  for  any  given  n*,     for  example,  for  n«2  it  is  easily  shown  that  the 
vanishing  of  all  functionals  of  weight  3  implies  the  vanishing  of  all  func- 
tionals  of  weight  U.     For  n=l,  it  is  trivial  that  all  fimctionals  of  a  weight 
^  2  vanish  if  those  of  weight  2  vanish. 

To  prove  Theorem  IV  we  observe  that  all  steps  in  the  formal  proof 
of  (III. 18)  now  involve  only  a  finite  number  of  terms.     Firstly,  the   solution 
Sj-    of  (17)  by  iteration  now  gives  a  finite  sum  of  functionals  which  we  denote 
by  B     (v=l,...,n)  in  accordance  with  (ll).     Secondly,   it  follows  directly  that 
the  Lie-product 

(18)       [[...[[|^.-^]n]...]ii] 

of  at  least  m+1  factors  vanishes  identically  of  m  ^  n.     Now  we  derive  from 
(III. 3) 

^)'-^  dJl        1     rdil      m        1     rrd/l 


This  formula  makes  sense  for  any  differentiable  finite  matrix  ^(t)   since  it 
can  be  shown  to  be  an  absolutely  and  vmiformly  convergent  rearrangement  of  the 
series  obtained  by  differentiating  exp(^)  directly  term  by  term  and  multiply- 
ing by  exp(-/l)   afterwards.     From  (18)   it  follows  that  the  right-hand  side  in 
(19)  is  a  terminating  series.     Calling  its  sum  B,  we  find  directly  from  (18), 
(19)   that 


(20,  if.E%a»'^i' 


n-l 

E 

v=0 

where  the  p     are  explained  in  Theorem  III.     On  the  other  hand,  J Z  could  also  have 
been  derived  from  , 

v=»0 

since  the  higher  terms  in  (l?)  do  not  contribute  toiJ  .     Putting  B-A  »  C  we 

mei^iy  have  to  show  that  the  equation 

n-l 


(22)  IIPv  (^'-^"^    -° 

v=0  ^  J 
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cannot  have  a  solution  C  which  does  not  vanish  identically  and  which  is  such  that 
|c,Xl'"V  »  0  if  m  ^  n.  This  can  be  shown  by  applying  bracket  multiplication  by 
to  (22)  k  ■  n-l,n-2,,.,,  1  times.  Then  we  find  that 

(23)  E  U  0,  Ji^l  -  e"'  U  C.Sl^*H.O. 


E^{=.^n-"i:" 

v=0      *^''  -^        v»0 

and  this  gives  recursively 


(2U)  P„|C,  il^-^J     -  p^^C,  Jl°-2}    -    ...  -  p^    [pMl  -  0. 

Since  p  ^  0,  combining  (22)  and  (2U)  we  find  that  C  «  0,  and  this  finishes  the 
proof  of  the  first  part  of  Theorem  IV. 

The  statement  in  Theorera  IV  about  (lU)  being  a  sufficient  condition 
for  (13)  to  be  true  is  almost  trivial.  To  show  that  it  is  not  a  necessary 
condition  we  take  n  »  1  and 


(°x  'y°- 


A(t)  «  (cost  -  cos2t  )  I       )for  0  <  t  ^  2n 

(25) 

'(t-2n)2 

A(t)  -  I  \  for  t  >  2n  . 


(t-2n)5' 


Clearly 


> 


t 

(26)  IMt),  /   A(s)ds  1  -  0 

'o 

but  if  0  <  s,  <  ^  and  if  Sp  >  2ti,  then 

(27)  [a(s^),  A(e2)  J  ¥  0. 

It  remains  an  open  question  whether  (lU)  is  a  necessary  condition 
for  the  vanishing  of  all  Lie-integral  functionals  of  weight  m  >  n  if  A(t)  can 
be  expanded  in  a  power  series  in  t. 
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VI.  Conditions  for  the  Existence  of  a  Solution    Y  «  exp-TI    for  Y  -  AY» 

Ve  consider  again  a  system  of  linear  differential  equations  of  the 
first  order: 

(1)  §    -  A(t)  T(t)    , 

where  Y  and  A  are  n  by  n    matrices  the  elements  of  which  are  functiohs  of  a  para- 
meter t.     We  assume  again  that  Y(o)  is  the  identity  I  and  that  A(t)  is  continu- 
ous in  t,   although  it  is  well  known  that  the  latter  condition  could  be  weakened. 

If  we  wish  to  represent  the  solution  of   (l)  in  the  foria  Y  «  exp  S2  , 
the  technique  based  on  (III. 17)   and  (ill. 18)  will  work  for  sufficiently  small 
values  of    |t|  .    Also,   it  is  well  known  that  any  preassigned  constant  matrix  Y 
can  be  wrttten  in  the  form  exp  /2 ,  if  the   determinant    |Y  |  of  Y  is  different 
from  zero .     From  Section  V  we  know  that  Y  is  finite  and    |Y    |  /  0  everywhere . 
Nevertheless,   if  the  function -O  (t)  is  assumed  to  be  dif ferentiable ,  it  may 
not  exist  everywhere.     This  can  be  shown  by  the  following  considerations. 
■Je  may  assume   that  for  a  fixed  value  of  t  •  t     we  may  start  from  t  «  0  and 
arrive  at  the  value  for  the  solution  of   (l): 

(2)  Y^  -  Y(t^)   -  exp-Q^   ,    -0^  --O(t^). 

Let  us  consider  Y     andfl      as  points  S     and  S     respectively  of  2n  -dimensional 
o  0  y  0) 

space,  where  the  Cartesian  coordinates  'V      and  co     of  these  spaces  consist  of 

the  i*eal  and  imaginary  parts  of  the  n     elements  of  an  arbitrary  matrix  X  or  Si.    . 
The  formula 

(3)  Y  =  expjl 

defines  a  mapping  of  S  into  S  such  that  the  coordinates  in  S  become  entire 
analytic  functions  of  the  coordinates  of  S  .  'ihs   functional  determinant 

•^  CO 

(U)  A     .      I  !|^   I  (v,ii-l,....,2n2) 

of  this  mapping  is  an  analytic  function  of  the  co   .     If  A   does  not  vanish  at 
a  certain  point  -Q.      of  S  ^,  then  a  neighborhood  of  Sj-       is  mapped  continuously 
with  a  one-to-one  correspondence  upon  a  fxxll  neighborhood  of  Y   .     In  this  case, 
the  solution  Y  -  exp  ii  of  (l)  can  be  continued  beyond  the  value  t»t^  to  a  value 
t^  >  t^    .       If,  however, A  -  0  at  Y  =  Y^,   then  dY/dt  «  AY  may  point  towards  a 
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part  of  S     which  is  not  covered  by  the  map  of  S     in  S   .     In  this  case,   dii/dt 
cannot  exist  at-Q  = -'■^    .     Actually,  the  question  reduces  to  the  problem  of 
solving 

SI 


{^■rf- 


with  respect  to  d-Q/dt.     If  this  is  possible  for  any  A  in  the  neighborhood 
(in  S   )  of  a  point/2  "  IJ-  ,   and  if  the  result  is  of  the  type 


CO 


(6)  ^.F(A,a), 

where  the  right-hand  side  in  (6)  is  a  matrix  depending  analytically  on  the 
eleiTientiof -A  ,  then  (l)  has  a  solution  of  type  (3)  in  the  neighborhood  of 
Y     =  expJl      »  expJJ- (t  ).     We  shall  prove  the  following  result: 

Theorem  V.     The  functional  determinant  Z^  defined  by  (U)   does  not 
vanish,  and  (5)  can  be  solved  by  an  expression  (6)   in  the  neighborhood  of 
any  point J^      in  S     for  an  arbitrary  A  if  and  only  if  none  of  the  differences 

between  any  two  of  the  eigenvalues  of _Q      eqpaals  2mni,  where  m  =  1  1,  1  2,... 
m  j<  0. 

Proof:  If  (3)  holds,  the  determinant    |l|   of  Y  is  different  from  zero. 
Putting 

(7)  dY  •  Y"-^  -  dZ  , 

we  may  compute  the   determinant  which  connects  the  elements  of  dZ  and  dil.     It 
will  differ  from  A  only  by  a  power  of    |Y|~     since  the  elements  of  each  row  of 
dZ  are  obtained  from  the  corresponding  row  of  dY  by  a  linear  substitution,   the 
matrix  of  which  is  the  transpose   of  Y"    .     From  (III. 3)  we  have 

(8)  dZ  -  dil     ^Ij.   [dH   ,-Q]    +  ^    [  Ld-0,-OjJ}]    ;    ...    . 

Let 

(9)  dZ  =  (dz       )    ,  dJl  '  (do       )    ,   (v,ii  -  l,...,n) 

and  assume  that 

(10)  -O     =A.    |X^,...,  X^   I 
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is  a  diagonal  matrix  with  tne  numbers  \,,...,X  in  the  main  diagonal.  In  this 

case  we  have  from  (8),(y)  and  (IC): 

exp  (X  -X  ) -1 

(11)  dz^,       =     r^i— tir doi 

^>^  \  -  \  v,n 

2  P 

The  n     quantities  dz         are  linear  functions  of  the  n     quantities  do)       .   and 

the   determinant  of  (ll)  is 

-f_-      exp  (X  -  X  )  -1 

where   the  pi^Dduct  is  extended  over  v,  n  =  1,    ,..,   n,  with  v  ^  y..     Let 

(13)  (x-X^)    (x-X„)    ...(x-X  )   =.  x""-  s,   x""-*-  +s^x""^  +    ...+   (-1)'^  s      , 

±  ^  n  12  n 

where  the   s^   (v=l,...,n)  are   the  elementary  syinmetric  functions  of   the  X   . 

Then  /^     becomes  a  function  of  the   s  .  which  is  analytic  and  entire  in  each  s    . 
Next,  let  ^  V 

(1)4)  SI    -  cAc"^  , 

where  C  is  a  matrix  the  determinant  of  which  equals  unity.     If  we  introduce 

(15)  dz""  =  c"-'-   •  dZ-  C,     djl*  =  C'-^-djl  •  C   , 

equation  (8}   becomes 

(16)  dz*  -  d-a*  -  |r  [dJf,A  I  .  1.  [  [dJl"',AjyVj  T  .-. 

and  instead  of  (ll)  we  have 

,      ,  exp   ( X.,  -  X, , )  -1 

(17)  dZ,  ,,   "  -J- ^ ^ d  CO 

v,ti  X^  -  X^  v,n 


Now 


^"^^^    '  ^^^^^,^^'    ^^^,^x)    =  C   (^%,^)    ' 


■«•         -1  1 

where  C     =  C    ®  C  is  the  Kronecker  product  of  C"-^  and  C,   and  where    (dZ       )jetc., 

2  v,(j. 

stands  for  the  vector  with  n     components  dZ       .       Therefore  the  dz       are  again 
linear  functions  of  the  dco       ,  -vrfiere  the  determinant  of   the  relation  is /\    . 
This  shows  that 

(18)  A*  -     1^1 


-  31  - 

whenever  Ji  can  be  trans  formed  into  diagonal  form.     NowZ^      in  (12)  is  a  func- 
tion of  the   s   ,  which  are  the  coefficients  of  the  characteristic  equation  for 

SI  .  Since  A  must  be  a  continuous  function  of  the  co  ,  let  Zi  bi'written 
as  a  function  of  the  s  .  This  gives  an  expression  for  Z\  "  which  is  valid  if 
_Q    has  different  eigenvalues.     But  in  S    the  neighborhood  of  every  point -/2 

contains  points  corresponding  to  matrices  which  have  different  eigenvalues. 
Therefore  the -^       in  (18)  is  given  by  (12)  for  all_0    .     ihis  proves  i'heorem 
V,   since  A      v.dll  vardsh  if  and  only  if  the  conditions  of  Theorem  V  are  satis- 
fied. 
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VII.     The  Example  of  the  Ordinary  Differential  Equation  of  Second  Order 

The  equation 
(1)  "    y"  +  Q(t)y-0 


may  be  written  as 
(2) 


^1     '^2 


72    •=  -Qy^ 

and  leads  to  the  matrix  equation 


(3)  g-AY, 


where  y,,  ^,    denote  two  linearly  independent  solutions  of  (l).     If  we  choose 
these  in  such  a  way  that 

y^(0)  -  1       y^(0)  -  y^'(O)  -  0 
(U) 

^l^iO)   -  0,^2^0)   -'^{(O)   -  1   , 

then  Y(0)  ■  I  and  the  determinant    |T  |  of  I  equals  unity  for  all  values  of  t. 
Therefore,   if  Y  »  exp  ^    ,   the  trace  ofJl    vanishes  and  we  may  put/i    into 
the  form 

(5)  Jl  -  f 

\I  -co 
If  we  introduce 

(6)  A  -  /i^  +  P  T  , 
we  find  after  some  calculation: 

(7)  Y  -  (sinhA)  j^  ^  (coshA)  I 

A 
Let 

(8)  ©   '  ^1  *  '*ll  • 
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be  the  trace  of  T.  Then  wa  have  from  (7) 

(9)  9  =  2  cosh  A 

(since  the  trace  cfjl   vanishes).     Actually,  2  A    is  the  difference  between  the 
eigenvalues  X.,  Xp  ofj7    •     If  this  difference  is   a  multiple  of  2nibut  differ- 
ent from  zero,  then  ^  ^  ^2  ^^"°®  X^  +  X^  «  0 ,     In  this  case,J^  can  be  trans- 
formed into  the  diagonal  form.       But  then,   although  both  the  eigenvalues  of  I 
equal  +1  or  -1,  it  is  possible  that  Y  is  not  a  multiple  of  the  unit  matrix. 
Indeed,  if  we  combine   (7)   and  (9)  we  find 

A 

(10)  Jl    -  (2Y  -  e  I) 

which  shows  that  for  A   -  2nni  we  have     0     -  Uj  therefore  if  ^  y^  0  and 
is  finite,   2Y  -  ©I  ■  0,     In  order  to  discuss  at  least  one  case  completely 
we  prove 

Theorem  VI.     Let  Q(t)  >  0  for  all  t  >  0.     ITien  a  solution  T  of  (3), 
with  the  initial  condition  Y(0)  =  I,  has  a  representation 

Y(t)   -  exp-0  (t)  ^  _Q.(0)   -  0,     (t  >  0) 

with^a  differentiable^TV  (t),  if  and  only  if 

(11)  (trace  jf  t   h 
for    t  ^  0  . 

Proof:     If  Y  «  +  I,  for  t  »  t^  we  have  y.^^  -  ±  1,  "^^-^  =  1  1, 
y^l  -  0,   VI  ^^  0,  and 


(12)  ^.y^.oil^yl-  Q^^     -0 


(13)  d_9     ,  y_|'  _  ^^^  .  t,^^  .  .  Q©  .  Q./^^  .     T  2Q  . 

dt 
Therefore  if  Q  >  0  then  0^  <  U  for  t  »  t   +  e     ,  if  e  is  positive  and  sufficiently 
small.     By  differentiating  (9),  we  find  A  -  iD,  where  cosD-  1  1,   D'   -  0,   and 

d"  =  ;^  i  0'   is  positive,     i^ierefore,   D  increases.     Now  we  let  t  increase.     We 

t  2  2 

arrive  at  a  point  where  ©«0,©     <Uorata  point  t-j^  where  0     «  Uj     C  has  been 
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increasinEf  in  the  interval  t    •g  t  <  t,  .     The  possibility  that  9  vanishes  is 

I  I  o  -■■  2    ,        ' 

excluded  because  y^Y)^   -  7-,^-,    "  1>   and  if  0   <U,  ©     "0,   then 

ilh)  \7-{ri^  1  -   ly^^gl  <  ^'      yi'^i  -  K"^  1  "  ^i"?!  -  <^?i  <  ^» 

2 

which  involves  a  contradiction.     Therefore,  D  niust  increase  until  0     reaches 

the  value  U  agsdn.     But  since  we   started  from  D  »  0,  we  have  arrived  at  a 
value  ofD-nn,n>0,  and  therefore,  if -fl   shall  exist,   Y  must  be   the  iden- 
tity.     This  proves  that  the  condition  0     ^  U  is  a  necessary  one.     But  it  is 
also  sufficient,     he  prove  this  as  follows;  If  0     ■  U  for  t  >  0  we  have 

de/dt  =  0  if  e  -  +  2,  and  therefore: 

(15)  7-^^^^'  12,     y^^-Q^i-O,     y^^^'  -  y^''^  ^  -  1. 

But  then  we  have    lyT '^i  I   =  ^>  ^'^  ^^ora  (lU)  we  see  that  y,^  i  '  ^>  ^^i  '  °» 
which  implies  '^  ,   »  0.     Then  y,   ■  0  and  from  this  we  have  y,    =»'W^   =  i  1.     There- 
fore Y  «  I  whenever  coshZ^"  1  1,  A  /  0.     ^hen-fl  is  completely  determined  for 
sinh  A  /  0  by  (10)   and  (?)   and  for  sinh  A  =  0,A  /  0  by  the  result  obtained 
from  differentiating   (10),  namely 

(16)  A^'  »  ( A  '  coshA)/!    . 

By  substituting  the  explicit  expression  (5)   for-Q.  and  the  product  AY    for  Y' 
we  find  from  (16)   and  (6) 

(17)  ^A=A,  yA  =  -qA,    0)  .  0,  gfT  =  A*^, 

whereA  =  nni,  n  ■  1  1,  1  2,  ...  From  (1?)  we  can  derive 


(18)  A  -  -  /-Q  , 

where  the  sign  of  the  square  root  in  (18)  is  determined  by  the  postulate  of  the 
differentiability  of  A  .  Now  SI.  is  completely  and  uniquely  determined  every- 
where, and  this  completes  the  proof  of  Theorem  VI. 
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